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Abstract 

This paper studies the properties of a new lower bound for the natural pseudo- 
distance. The natural pseudo-distance is a dissimilarity measure between shapes, 
where a shape is viewed as a topological space endowed with a real- valued continuous 
function. Measuring dissimilarity amounts to minimizing the change in the functions 
due to the application of homeomorphisms between topological spaces, with respect 
to the Loo-norm. In order to obtain the lower bound, a suitable metric between size 
functions, called matching distance, is introduced. It compares size functions by 
solving an optimal matching problem between countable point sets. The matching 
distance is shown to be resistant to perturbations, implying that it is always smaller 
than the natural pseudo-distance. We also prove that the lower bound so obtained 
is sharp and cannot be improved by any other distance between size functions. 
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1 Introduction 

Shape comparison is a fundamental problem in shape recognition, shape clas- 
sification and shape retrieval (cf., e.g., [24]), finding its applications mainly 
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in Computer Vision and Computer Graphics. The shape comparison prob- 
lem is often dealt with by defining a suitable distance providing a measure of 
dissimilarity between shapes (see, e.g., [25] for a review of the literature). 

Over the last fifteen years. Size Theory has been developed as a geometrical- 
topological theory for comparing shapes, each shape viewed as a topological 
space Ai, endowed with a real- valued continuous function (/? [15]. The pair 
{A4, If ) is called a size pair, while ip is said to be a measuring function. The 
role of the function (p is to take into account only the shape properties of the 
object described by A4 that are relevant to the shape comparison problem at 
hand, while disregarding the irrelevant ones, as well as to impose the desired 
invariance properties. 

A measure of the dissimilarity between two size pairs is given by the natural 
pseudo-distance. The main idea in the definition of natural pseudo-distance 
between size pairs is to minimize the change in the measuring functions due to 
the application of homeomorphisms between topological spaces, with respect 
to the Loo-norm: The natural pseudo-distance between {M,(p) and {M^ip) 
with and M homeomorphic is the number 

infmax|^(P)-^(M^))|, 

where h varies in the set H{M.,N) of all the homeomorphisms between M. 
and M . In other words, the variation of the shapes is modeled by the infinite- 
dimensional group of homeomorphisms and the cost of warping an object's 
shape into another is measured by the change of the measuring functions. An 
important feature of the natural pseudo-distance is that it does not require 
the choice of parametrizations for the spaces under study nor the choice of 
origins of coordinate, which in image applications would be arbitrarily driven. 

The main aim of this paper is to provide a new method to estimate the natural 
pseudo-distance, motivated by the intrinsic difficulty of a direct computation. 
Since the natural pseudo-distance is defined by a minimization process, it 
would be natural to look for the optimal transformation that takes one shape 
into the other, as usual in energy minimization methods. In our case, however, 
the existence of an optimal homeomorphism attaining the natural pseudo- 
distance is not guaranteed. 

Earlier results about the natural pseudo-distance can be divided in two classes. 
One class provides constraints on the possible values taken by the natural 
pseudo-distance between two size pairs. For example, if the considered topolog- 
ical spaces Ai and H are smooth closed manifolds and the measuring functions 
are also smooth, then the natural pseudo-distance is an integer sub-multiple of 
the Euclidean distance between two suitable critical values of the measuring 
functions [8]. In particular, this integer can only be either 1 or 2 in the case 
of curves, while it can be either 1, 2 or 3 in the case of surfaces [9]. The other 
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class of results furnishes lower boTinds for the natural pseudo-distance [7], [18]. 
In particular it is possible to estimate the natural pseudo-distance by using 
the concept of size function [7]. Indeed, size functions can reduce the com- 
parison of shapes to the comparison of certain countable subsets of the real 
plane (cf. [10], [13] and [16]). This reduction allows us to study the space of all 
homeomorphisms between the considered topological spaces, without actually 
computing them. The research on size functions has led to a formal setting, 
which has turned out to be useful, not only from a theoretical point of view, 
but also on the applicative side (see, e.g., [1], [3], [5], [6], [14], [19], [21]). 

This paper investigates into the problem of obtaining lower bounds for the 
natural pseudo-distance using size functions. 

To this aim, we first introduce the concept of reduced size function. Reduced 
size functions are a slightly modified version of size functions based on the 
connectedness relation instead of path-connectedness. This new definition is 
introduced in order to obtain both theoretical and computational advantages 
(see Rem. 3 and Rem. 9). However, the main properties of size functions are 
maintained. In particular, reduced size functions can be represented by sets of 
points of the (extended) real plane, called cornerpoints. 

Then we need a preliminary result about reduced size functions (Th. 25). It 
states that a suitable distance between reduced size functions exists, which 
is continuous with respect to the measuring functions (in the sense of the 
Loo-topology). We call this distance matching distance, since the underlying 
idea is to measure the cost of matching the two sets of cornerpoints describing 
the reduced size functions. The matching distance reduces to the bottleneck 
distance used in [4] for comparing Persistent Homology Groups when the mea- 
suring functions are taken in the subset of tame functions. We underline that 
the continuity of the matching distance implies a property of perturbation 
robustness for size functions allowing them to be used in real applications. 

Having proven this, we are ready to obtain our main results. Indeed, the sta- 
bility of the matching distance allows us to prove a sharp lower bound for the 
change of measuring functions under the action of homeomorphisms between 
topological spaces, i.e. for the natural pseudo-distance (Th. 29). Furthermore, 
we prove that the lower bound obtained using the matching distance not only 
improves the previous known lower bound stated in [7], but is the best possible 
lower bound for the natural pseudo-distance obtainable using size functions. 
The proof of these facts is based on Lemma 30. This lemma is a crucial re- 
sult stating that it is always possible to construct two suitable measuring 
functions on a topological 2-sphere with given reduced size functions and a 
pseudo-distance equaling their matching distance. On the basis of this lemma, 
in Th. 32 and Th. 34 we prove that the matching distance we are considering 
is, in two different ways, the best metric to compare reduced size functions. 
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This paper is organized as follows. In Section 2 we introduce the concept of 
reduced size function and its main properties. In Section 3 the definition of 
matching distance between reduced size functions is given. In Section 4 the 
stability theorem is proved, together with some other useful results. The con- 
nection with natural pseudo-distances between size pairs is shown in Section 5, 
together with the proof of the existence of an optimal matching between re- 
duced size functions. Section 6 contains the proof that it is always possible 
to construct two size pairs with pre-assigned reduced size functions and a 
pseudo-distance equaling their matching distance. This result is used in Sec- 
tion 7 to conclude that the matching distance furnishes the finest lower bound 
for the natural pseudo-distance between size pairs among the lower bounds 
obtainable through reduced size functions. In Section 8 our results are briefiy 
discussed. 



2 Reduced size functions 

In this section we introduce reduced size functions, that is, a notion derived 
from size functions ([15]) allowing for a simplified treatment of the theory. The 
definition of reduced size function differs from that of size function in that it 
is based on the relation of connectedness rather than on path-connectedness. 
The motivation for this change, as explained in Remark 3, has to do with the 
right-continuity of size functions. 

In what follows, A4 denotes a non-empty compact connected and locally con- 
nected Hausdorff space, representing the object whose shape is under study. 

The assumption on the connectedness of Ai can easily be weakened to any 
finite number of connected components without much affecting the follow- 
ing results. More serious problems would derive from considering an infinite 
number of connected components. 

We shall call any pair {Ai.Lp), where (/? : ^ R is a continuous function, 
a size pair. The function ip is said to be a measuring function. The role of 
the function (p is to take into account only the shape properties of the ob- 
ject described by Ai that are relevant to the shape comparison problem at 
hand, while disregarding the irrelevant ones, as well as to impose the desired 
invariance properties. 

Assume a size pair {A4,(p) is given. For every x e R, let A4{(p < x) denote 
the lower level set {P e M : <p{P) < x}. 

Definition 1 For every real number y, we shall say that two points P,QeM. 
are {(p < |/)-connected if and only if a connected subset C of A4{(p < y) exists, 
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(a) 



(b) 



Fig. 1. (b) The reduced size function of the size pair {M, ip), where M. is the curve 
represented by a continuous hne in (a), and (p is the function "distance from the 
point P". 

containing both P and Q. 

In the following, A denotes the diagonal {{x,y) e M."^ : x — y}; denotes 
the open half- plane {{x,y) e M.'^ : x < y} above the diagonal; denotes the 
closed half-plane {(x, y) e : x < y} above the diagonal. 

Definition 2 (Reduced size function) We shall call reduced size function as- 
sociated with the size pair {A4,(p) the function (^*M,ip) '■ ~^ ^> defined by 
setting ip)(^'y) ^Q'^'^^ t'^ the number of equivalence classes into which the 
set M.{(f < x) is divided by the relation of {ip < y) -connectedness. 

In other words, ip)i-'^^y) counts the number of connected components in 
M.{(p < y) that contain at least one point of A^(</? < x). The finiteness of 
this number is an easily obtainable consequence of the compactness and local- 
connectedness of Jv[. 

An example of reduced size function is illustrated in Fig. 1. In this example we 
consider the size pair {M.^ p), where Ai is the curve represented by a continu- 
ous line in Fig. 1 (a), and (/? is the function "distance from the point P". The 
reduced size function associated with {J\4, (p) is shown in Fig. 1 (b). Here, the 
domain of the reduced size function is divided by solid lines, representing the 
discontinuity points of the reduced size function. These discontinuity points 
divide A"*" into regions in which the reduced size function is constant. The 
value displayed in each region is the value taken by the reduced size function 
in that region. 

For instance, for a < x < 6, the set {P e M. : <p{P) < x} has two connected 
components which arc contained in different connected components of {P G 
A4 : ipiP) < y} when x < y < b. Therefore, ^p){x,y) = '2 for a < x < b 
and X < y < b. When a < x < b and y > b, all the connected components 
oi {P E M : ^{P) < x} are contained in the same connected component 



5 



of {P E M : (f{P) < y}. Therefore, t^M,<p)(^'y) = 1 for a < x < 6 and 
y > b. When b < x < c and y > c, all of the three connected components of 
{P G Ai : V'(P) < x} belong to the same connected component of {P G A4 : 
viP) < y}, implying that in this case i*M,v)^^^y) = 1- 

As for the values taken on the discontinuity lines, they are easily obtained by 
observing that reduced size functions are right- continuous, both in the variable 
X and in the variable y. 

Remark 3 The property of right-continuity in the variable x can easily be 
checked and holds for classical size functions as well. The analogous property 
for the variable y is not immediate, and in general does not hold for classical 
size functions, if not under stronger assumptions, such as, for instance, that 
is a smooth manifold and the measuring function is Morse (cf. Cor. 2.1 in 
[12]). Indeed, the relation of {(p < ?/)-homotopy, used to define classical size 
functions, does not pass to the limit. On the contrary, the relation of ((/? < y)- 
connectedness does, that is to say, if, for every e > it holds that P and Q are 
(95 < 2/ + e)-connected, then they are {(f < y)-connected. To see this, observe 
that connected components are closed sets, and the intersection fl? of a 
family of compact, connected Hausdorff subspaces Ki of a topological space, 
with the property that K^^i C Kj, for every i, is connected (cf. Th. 28.2 in 
[26] p. 203). 

Most properties of classical size functions continue to hold for reduced size 
functions. For the aims of this paper, it is important that for reduced size func- 
tions it is possible to define an analog of classical size functions' cornerpoints 
and cornerlines, here respectively called proper cornerpoints and cornerpoints 
at infinity. The main reference here is [16]. 

Definition 4 (Proper cornerpoint) For every point p = {x,y) G A+, let us 
define the number n{p) as the minimum over all the positive real numbers e, 
with X + e < y — e, of 

^*{M,^) {x + ^,y-^)-^{M,ip)i^-^^y-^)-^{M,ip)i^ + ^^y + ^)+ ^{M,ip) {x-e,y + e). 

The finite number ii{p) will be called multiplicity of p for (-\m,^p)- Moreover, 
we shall call proper cornerpoint for ('\m,(p) ^''^V PO'int p G A"*" such that the 
number ii{p) is strictly positive. 

Definition 5 (Cornerpoint at infinity) For every vertical line r, with equation 

X = k, let us identify r with the pair {k,oo), and define the number yu(r) as 
the minimum, over all the positive real numbers e with k + e < 1/e, of 

^*M,^){k + e, - £lM,^){k - e, 1/e). 

When this finite number, ca//ed multiplicity of r for is strictly positive, 

we call the line r a cornerpoint at infinity for the reduced size function. 
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Fig. 2. Cornerpoints of a reduced size function: in this example, p, q and r are the 
only proper cornerpoints, and have multiplicity equal to 2 (p) and 1 {r,q). The 
point s is not a cornerpoint, since its multiplicity vanishes. The line m is the only 
cornerpoint at infinity. 

Remark 6 The multiplicity of points and of vertical lines is always non neg- 
ative. This follows from an analog of Lemma 1 in [16], based on counting the 
equivalence classes in the set 

{PeM{<p< X2) e M{(p < xi) s.t. P ^^<y, Q} 

quotiented by the relation of {(p < |/i)-connectedness, in order to obtain the 
number 

^(M¥')(^2,yi) -•^(yw,<^)(a;i,yi) 

when xi < X2 < yi- 

Remark 7 Under our assumptions on M., i.e. its connectedness, /x(r) can only 
take the values and 1, but the definition can easily be extended to spaces 
with any finite number of connected components, so that //(r) can equal any 
natural number. Moreover, the connectedness assumption also implies that 
there is exactly one cornerpoint at infinity. 

As an example of cornerpoints in reduced size functions, in Fig. 2 we see that 
the proper cornerpoints are the points p, q and r (with multiplicity 2, 1 and 
1, respectively). The line m is the only cornerpoint at infinity. 

The importance of cornerpoints is revealed by the next result, analogous to 
Prop. 10 of [16], showing that cornerpoints, with their multiplicities, uniquely 
determine reduced size functions. 

The open (resp. closed) half-plane A"*" (resp. A+) extended by the points at 
infinity of the kind {k, 00) will be denoted by A* (resp. A*), i.e. 

A* := A+ U {{k, 00): ke M}, A* := A+ U {{k, 00) : k e K}. 

Theorem 8 (Representation Theorem) For every {x, y) e A+ we have 
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x<x,y>y 



(1) 



The equality (1) can be checked in the example of Fig. 2. The points where 
the reduced size function takes value are exactly those for which there is no 
cornerpoint (either proper or at infinity) lying to the left and above them. Let 
us take a point in the region of the domain where the reduced size function 
takes the value 3. According to the above theorem, the value of the reduced 
size function at that point must be equal to ii{m) + ii{p) — 3. 

Remark 9 By comparing Th. 8 and the analogous result stated in Prop. 10 
of [16], one can observe that the former is stated more straightforwardly. As a 
consequence of this simplification, all the statements in this paper that follow 
from Th. 8 arc less cumbersome than they would be if wc applied size functions 
instead of reduced size functions. This is the main motivation for introducing 
the notion of reduced size function. 

In order to make this paper self-contained, in the rest of this section we report 
all and only those results about size functions that will be needed for proving 
our statements in the next sections, re-stating them in terms of reduced size 
functions. Proofs are omitted, since they are completely analogous to those 
for classical size functions. 

The following result, expressing a relation between two reduced size functions 
corresponding to two spaces, A4 and J\f, that can be matched without changing 
the measuring functions more that h, is analogous to Th. 3.2 in [14]. 

Proposition 10 Let {Ai,ip) and [M^ip) he two size pairs. If f : M. M is 
a homeomorphism such that maxpgx |<^(P) — V'(/(P))| < h, then for every 
(x, y) e A+ we have 

^Im,<p){^ -h,y + h)< ilj^^^){x, y). 

The next proposition, analogous to Prop. 6 in [16], gives some constraints on 
the presence of discontinuity points for reduced size functions. 

Proposition 11 Let {M.,(fi) be a size pair. For every point p — {x,y) G 
a real number e > exists such that the open set 

We{p) := {{x, y) eM."^ : \x - x\ < e,\y - y\ < e,x x,y y} 

is contained in and does not contain any discontinuity point for l*j^ ^py 

The following analog of Prop. 8 and Cor. 4 in [16], stating that cornerpoints 
create discontinuity points spreading downwards and towards the right to A, 
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Fig. 3. A reduced size function with cornerpoints accumulating onto the diagonal: 
M. is the space depicted on the left and measures the height of each point. 

also holds for reduced size functions. 

Proposition 12 (Propagation of discontinuities) If p = {x,y) is a proper 
cornerpoint for then the following statements hold: 

i) If ^ ^ X < y, then y is a discontinuity point for ')/ 

ii) If X < y < y, then x is a discontinuity point for £*(^j^^^^{-,y). 

Iff= {x, oo) is the cornerpoint at infinity for then the following state- 

ment holds: 

iii) If X <y, then x is a discontinuity point for i*(^j^^^^{-,y). 

The position of cornerpoints in is related to the extrema of the measur- 
ing function as the next proposition states, immediately following from the 
definitions. 

Proposition 13 (Localization of cornerpoints) If p — {x,y) is a proper cor- 
nerpoint for £*jt^^^^, then 

p e {{x,y) e : min(^ <x<y< max(^}. 

If f — (x, oo) is the cornerpoint at infinity for £*M,ip)> then x — min (/?. 

Prop. 12 and Prop. 13 imply that the number of cornerpoints is either finite 
or countably infinite. In fact, the following result can be proved, analogous to 
Cor. 3 in [16]. 

Proposition 14 (Local finiteness of cornerpoints) For each strictly positive 
real number e, reduced size functions have, at most, a finite number of corner- 
points in {(x, y) e : X + e < y}. 
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Therefore, if the set of cornerpoints of a reduced size function has an accumu- 
lation point, it necessarily belongs to the diagonal A. An example of reduced 
size function with cornerpoints accumulating onto the diagonal is shown in 
Fig. 3. 

Moreover, this last proposition implies that in the summation of Th. 8 (Rep- 
resentation Theorem), only finitely many terms are different from zero. 



3 Matching distance 

In this section we define a matching distance between reduced size functions. 
The idea is to compare reduced size functions by measuring the cost of trans- 
porting the cornerpoints of one reduced size function to those of the other one, 
with the property that the longest of the transportations should be as short 
as possible. Since, in general, the number of cornerpoints of the two reduced 
size functions is different, we also enable the cornerpoints to be transported 
onto the points of A (in other words, we can "destroy" them). 

When the number of cornerpoints is finite, the matching distance may be re- 
lated to the bottleneck transportation problem (cf., e.g., [11], [20]). In our 
case, however, the number of cornerpoints may be countably infinite, because 
of our loose assumption on the measuring function, that is only required to 
be continuous. Nevertheless, we prove the existence of an optimal match- 
ing. Under more tight assumptions on the measuring function, the number 
of cornerpoints is ensured to be finite and a bottleneck distance can be more 
straightforwardly defined. For example, in [4] a bottleneck distance for com- 
paring Persistent Homology Groups is introduced under the assumption that 
the measuring functions are tame. We recall that a continuous function is 
tame if it has a finite number of homological critical values and the homology 
groups of the lower level sets it defines are finite dimensional. 

Although working with measuring functions that are continuous rather than 
tame involves working in an infinite dimensional space, yielding many technical 
difficulties in the proof of our results (for instance, compare our Matching 
Stability Theorem 25 with the analogous Bottleneck Stability Theorem for 
Persistence Diagrams in [4]), there are strong motivations for doing so. First 
of all, in real applications noise cannot be assumed to be tame, so that the 
perturbation of a tame function may happen to be not tame. In second place, 
when working in the more general setting of measuring functions with values 
in M.^ instead of M, it is important that the set of functions is closed under the 
action of the max operator, as shown in [2] , whereas the set of tame functions 
is not. Last but not least, working with continuous functions allows us to 
relate the matching distance to the natural pseudo-distance, which is our final 
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goal, without restricting the set of homeomorphism to those preserving the 
tameness property. 

Of course the matching distance is not the only metric between reduced size 
functions that we could think of. Other metrics for size functions have been 
considered in the past ([10], [13]). However, the matching distance is of partic- 
ular interest since, as we shall see, it allows for a connection with the natural 
pseudo-distance between size pairs, furnishing the best possible lower bound. 
Moreover, it has already been experimentally tested successfully in [1] and [3]. 

In order to introduce the matching distance between reduced size functions we 
need some new definitions. The following definition of representative sequence 
is introduced in order to manage the presence in a size function of infinitely 
many cornerpoints as well as that of their multiplicities. Moreover, it allows 
us to add to the set of cornerpoints a subset of points of the diagonal. 

Definition 15 (Representative sequence) Let i* be a reduced size function. 
We shall call representative sequence for i* any sequence of points a : N — > A*, 
(briefly denoted by (oj) ), with the following properties: 

(1) Gq is the cornerpoint at infinity for I* ; 

(2) For each i > either ai is a proper cornerpoint for i*, or ai belongs to 
A; 

(3) If p is a proper cornerpoint for i* with multiplicity n{p), then the cardi- 
nality of the set {i E N : at = p} is equal to ^{p); 

(4) The set of indexes for which Oj is in A is countably infinite. 

We now consider the following pseudo-distance d on A* in order to assign a 
cost to each deformation of reduced size functions: 

d ((x, y) , (x', y')) := min |max {|a; — x'\, \y — y'\} , max 

with the convention about oo that oo—y — y — oo — ooior y ^ oo, oo — oo = 0, 
Y — oo, |oo| = oo, min{oo, c} = c, max{oo, c} = oo. 

In other words, the pseudo-distance d between two points p and p' compares 
the cost of moving p to p' and the cost of moving p and p' onto the diagonal 
and takes the smaller. Costs are computed using the distance induced by the 
max-norm. In particular, the pseudo-distance d between two points p and p' 
on the diagonal is always 0; the pseudo-distance d between two points p and 
p', with p above the diagonal and p' on the diagonal, is equal to the distance, 
induced by the max-norm, between p and the diagonal. Points at infinity have 
a finite distance only to other points at infinity, and their distance depends 
on their abscissas. 

Therefore, d{p,p') can be considered a measure of the minimum of the costs 



y — X y — X 
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of moving p to p' along two different paths (i.e. the path that takes p directly 
to p' and the path that passes through A). This observation easily yields that 
d is actually a pseudo-distance. 

Remark 16 It is useful to observe what disks induced by the pseudo-distance 
d look like. For r > 0, the usual notation B(p,r) will denote the open disk 
{p' e A"*" : d{p,p') < r}. Thus, if p is a proper point with coordinates {x,y) 
and y — X >2r (that is, d{p, A) > r), then B{p, r) is the open square centered 
at p with sides of length 2r parallel to the axes. Whereas, if p has coordinates 
{x, y) with y — X < 2r (that is, d{p, A) < r), then B{p, r) is the union of the 
open square, centered at p, with sides of length 2r parallel to the axes, with 
the stripe {{x^y) eM? :{) <y — x < 2r}, intersected with A+(scc also Fig. 4). 
lip = (x, oo) is a point at infinity then B{p,r) = {{x', oo) G A* : |a; — a;'| < r}. 



y 




Fig. 4. Disks induced by the pseudo-metric d (shaded). Left: d{p,A) > r. Right: 
d{p, A) < r. 

In what follows, the notation Q{p,r) will refer to the open square centered 
at the proper point p, with sides of length 2r parallel to the axes (that is, 
the open disk centered at p with radius r, induced by the max- norm). Also, 
when r > 0, Q{p,r) will refer to the closure of Q{p,r) in the usual Euclidean 
topology, while Qip,0) := {p}. 

Definition 17 (Matching distance) Let l\ and £2 be two reduced size func- 
tions. If{ai) and {hi) are two representative sequences for i\ and £2 respectively, 
then the matching distance between t{ and £3 is the number 

dmatch{£*i, Q ■= inf sup d{ai, ba{i)), 

where i varies in N and a varies among all the bijections from N to N. 

In order to illustrate this definition, let us consider Fig. 5. Given two curves, 
their reduced size functions with respect to the measuring function distance 
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from the center of the image are calculated. One sees that the top reduced 
size function has many cornerpoints close to the diagonal in addition to the 
cornerpoints r, a, b, c, d, e. Analogously, the bottom reduced size function 
has many cornerpoints close to the diagonal in addition to the cornerpoints 
r', a', b', d . Cornerpoints close to the diagonal are generated by noise and 
discretization. The superimposition of the two reduced size functions shows 
that an optimal matching is given by r — r', a — > a', 6 — 6', c — > c', c? — > A, 
e — *■ A, and all the other cornerpoints sent to A. Sending cornerpoints to 
points of A corresponds to the annihilation of cornerpoints. Since the matching 
c — > c' is the one that achieves the maximum cost in the max-norm, the 
matching distance is equal to the distance between c and d (with respect to 
the max-norm). 




Fig. 5. Left: Two curves. Center: Their reduced size functions with respect to the 
measuring function distance from the center of the image. Right: The superimposi- 
tion of the two reduced size functions. 

Proposition 18 dmatch is a distance between reduced size functions. 



PROOF. It is easy to see that this definition is independent from the choice 
of the representative sequences of points for il and £2- fact, if (oj) and (di) 
are representative sequences for the same reduced size function £*, a bijection 
(J : N — > N exists such that d{di, a^(^i^) = for every index i. 

Furthermore, we have that dmatch{.£\M (/?)' ^(V v)) ^ +00, for any two size pairs 
(A^,(y9) and {Af,ip)- Indeed, for any bijection cr : N — N such that a{0) = 0, 
it holds supi d{ai,bo-{i)) < +00, because of Prop. 13 (Localization of corner- 
points) . 
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Finally, by recalling that reduced size functions arc uniquely determined by 
their cornerpoints with multiplicities (Representation Theorem) and by us- 
ing Prop. 14 (Local finiteness of cornerpoints), one can easily see that dmatch 
verifies all the properties of a distance. □ 

We will show in Th. 28 that the inf and the sup in the definition of matching 
distance are actually attained, that is dmatchi^-ii (-2) = niino- maxj (i(aj, 6o-(j)). In 
other words, an optimal matching always exists. 



4 Stability of the matching distance 

In this section we shall prove that if 9? and ip are two measuring functions on }A 
whose difference on the points of Ai is controlled by e (namely maxpg^i \lp{P) — 
'0(-P)| < e), then the matching distance between and is also con- 

trolled by e (namely dmatch{^lM,<pV ^Im,^i,)) ^ 

For the sake of clarity, we will now give a sketch of the proof that will lead 
to this result, stated in Th. 25. We begin by proving that each cornerpoint of 
^*{M <p) with multiplicity m admits a small neighborhood, where we find exactly 
m cornerpoints (counted with multiplicities) for ^(^^j .^), provided that on Ai 
the functions ip and ip take close enough values (Prop. 20). Next, this local 
result is extended to a global result by considering the convex combination 
— li/^ + ^<p oi if and ■0- Following the paths traced by the cornerpoints 
of as t varies in [0, e], in Prop. 21 we show that, along these paths, 

the displacement of the cornerpoints is not greater than e (displacements are 
measured using the distance d, and cornerpoints are counted with their mul- 
tiplicities). Thus we are able to construct an injection /, from the set of the 
cornerpoints of £Im,<p) cornerpoints of C*(^M,i)) (extended to a 

countable subset of the diagonal), that moves points less than e (Prop. 23). 
Repeating the same argument backwards, we construct an injection g from 
the set of the cornerpoints of ^\m,'^) '-'^ cornerpoints of ('\m,(p) 

(extended to a countable subset of the diagonal) that moves points less than 
e. By using the Cantor-Bernstein theorem, we prove that there exists a bijec- 
tion from the set of the cornerpoints of ^-^ to the set of the cornerpoints 
of ^-j (both the sets extended to countable subsets of the diagonal) that 
moves points less than e. This will be sufficient to conclude the proof. Once 
again, we recall that in the proof we have just outlined, cornerpoints are always 
counted with their multiplicities. 

We first prove that the number of proper cornerpoints contained in a suf- 
ficiently small square can be computed in terms of jumps of reduced size 
functions. 
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Proposition 19 Let {Ai, (p) be a size pair. Let p = {x, y) G and letrjyO 

be such that x + r] < y — rj. Also let a = {x + r),y — rj), h — {x — r),y — rf), 
c = (x + 77, y + r/), e = {x — r],y + rj). Then 

is equal to the number of (proper) cornerpoints for counted with their 

multiplicities, contained in the semi-open square Qjj, with vertices at a, b, c, e, 
given by 

Qv — {(^' y) e : X -ri < X < x + Tj, y-r]<y<y + r]}. 



PROOF. It easily follows from the Representation Theorem (Th. 8). □ 



We now show that, locally, small changes in the measuring functions produce 
small displacements of the existing proper cornerpoints and create no new 
cornerpoints. 

Proposition 20 (Local constancy of multiplicity) Let {Ai,Lp) be a size pair 
and let p = {x,y) be a point in with multiplicity ii{p) for (-\M,ip) (pos- 
sibly fj.{p) = 0). Then there is a real number fj > such that, for any real 
number rj with < rj < fj, and for any measuring function ip : Ai ^ M. with 
maxpg_A/i \<f{P) — ^ Vj 'the reduced size function has exactly fj,{p) 

(proper) cornerpoints (counted with their multiplicities) in the closed square 
Q{p,ri), centered at p with side 2f]. 



PROOF. By Prop. 11, a sufficiently small real number e > exists such that 
the set 

Weip) = {{x,y) eR^ : \x - x\ < €,\y - y\ < €,x x,y y} 

is contained in A+ (i.e. x-\-e <y — e), and does not contain any discontinuity 
point for Prop. 12 (Propagation of discontinuities) implies that p is the 

only cornerpoint in Q{p, e). 

Let fj be any real number such that Q < fj < e/2. For each real number 77 with 
< 77 < 77, let us take a sufficiently small positive real number 5 with S < rj 
and 277 + 5 < e, so that x -\- 2r] -\- 6 < y — 2r] — 5. 

We define a = {x+rj+S, y—ri—5), b = {x—rj—d, y—r)—5), c = (a;+r7+5, y+ri+6), 
e = {x - 7] - 6,y + 7] + 6). 
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^2rj + S,y -2ri - 5) 



Fig. 6. The sets We{p) and Qjj+s used in the proof of Prop. 20. 

If : — > R is a measuring function such that maxpg^ \ f{P) ~ ^ 
then by applying Prop. 10 twice, 

Since is constant in each connected component of W^{p), we have that 

^*{M,v)i^ + S,y-5) = £(^,^)(a) = i*M,,p)ix + 2r) + 5,y -2r) - 5), 

implying ^(^,^)(a) = ^(^,^)(a). Analogously, ^(^,^)(&) = ^(>t,^)(c) = 

^(M,^)ic), i{M,^){e) = ^(x,^)(e). Hence, i.e. the multiplicity of p for t^M,^)^ 
equals 

By Prop. 19, we obtain that fx{p) is equal to the number of cornerpoints for 
•^(A4,V') contained in the semi-open square with vertices a, b, c, e given by 

Qri+s = {{x, y) e A"*" : x — -q — 5 < x < x -\- -q + 5,y — -q — 5 < y < y -\- -q + 5} . 

This is true for any sufficiently small 5 > 0. Therefore, is equal to the 
number of cornerpoints for ^(_a4 ^) contained in the intersection ni>oQr/+(5- It 
follows that ii{p) amounts to the number of cornerpoints for ^M,tp) contained 
in the closed square Q{p,ri). □ 

The following result states that if two measuring functions ip and ip differ less 
than e in the Loo-norm, then it is possible to match some finite sets of proper 
cornerpoints of ^lM,tp) proper cornerpoints of ^lM,tp)y with a motion smaller 
than e. 

Proposition 21 Let e > be a real number and let {A4,ip) and {M.,ip) be 
two size pairs such that maxp^^^ \(p{P) — < Then, for any finite 
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set K of proper cornerpoints for ^*m,^) ^^^^ d{K, A) > e, there exist (ai) 
and representative sequences for (■\m,<p) ^'^^ ^\m,iIi) f^spectively, such that 
d{ai, hi) < e for each i with e K. 

PROOF. The claim is trivial for e = 0, so let us assume e > 0. 

Let $t = ^-0 + with t e [0,e]. Then, for every t,t' e [0, e], we have 
maxpe^ \^t{P) -\'{P)\ <\t- t'\. 

Let K = {pi, . . . ,pk}, let rrij be the multiplicity of pj, for j = 1, . . . , k, and 
m — J2j=i f^j- Then we can easily construct a representative sequence of points 

{ai) for ^\m,<pY ^^^^ 

— Pl-i ■ ■ ■ ■> 0"mi — Pl-i O'mi+1 — P2, ■ ■ ■ , 0'mi+m2 — P2, ■ ■ ■ , 
^mi+...+mk-i — Pk—lj ^mi+...+mk-1+l — Pkj ■ ■ ■ j — Pk- 

Now we will consider the set A defined as 

{S e [0, e] : 3(af ) representative sequence for s.t. d{ai, af) < S, \/ai e K}. 

In other words, if we think of the variation of t as the flow of time, A is the 
set of instants 5 for which the cornerpoints in K move less than 5 itself, when 
the homotopy is applied to the measuring function (/?. 

A is non-empty, since e A. Let us set 5 = sup A and show that 6 E A. Indeed, 
let {6n) be a sequence of numbers of A converging to 5. Since 5„ e A, for each 
n there is a representative sequence (of") for £*m,^s ) with d{ai, af") < 5n, for 
each i such that a, e K. Since Sn < e, o?(ai,af") < e for any i and any n. 
Thus, recalling that d{K, A) > e, for each i such that G i^, it holds that 
af"- G e) for any n. Hence, for each i with a, G -fT, possibly by extracting 
a convergent subsequence, we can define af = lim„ af". We have d{ai, af) < S. 
Moreover, by Prop. 20 (Local constancy of multiplicity), af is a cornerpoint 
for Also, if r indexes ji, ■ ■ ■ ,jr exist, such that aj^, . . . , aj^ G K and 

a^^ = • • • = a^j^ —: q, then the multiplicity of q for -^(^^ $_) is not smaller 
than r. Indeed, since 5n — > ^, for each arbitrarily small ri > and for any 
sufficiently great n, the square Q{q,ri) contains at least r cornerpoints for 
3>i )' counted with their multiplicities. But Prop. 20 implies that, for each 
sufficiently small rj, Q{q, rf) contains exactly as many cornerpoints for ) 
as the multiplicity of q with respect to ^(^v^ if — 5| <r]. Therefore, the 
multiplicity of q for ^*;v(,<i>^) greater than, or equal to, r. 

The previous reasoning allows us to claim that if a cornerpoint q occurs r 
times in the sequence (af , . . . , a^), then the multiplicity of q for ^(^^$-) is at 
least r. 
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In order to conclude that S&A,it is now sufficient to observe that (af , . . . , a^) 
is easily extensible to a representative sequence for ^_-) , simply by setting 
ao equal to the corncrpoint at infinity of i'(^vi#^)) ^'^d by continuing the se- 
quence with the remaining proper cornerpoints of and with a countable 
collection of points of A. So we have proved that sup A is attained in A. 

We end the proof by showing that max A = e. In fact, if 5 < e, by us- 
ing Prop. 20 once again, it is not difficult to show that there exists 77 > 0, 
with S + ri < e, and a representative sequence (af'*''') for ), such 

that d(af, a^^^) < rj for 1 < i < m. Hence, by the triangular inequality, 
d{ai, a^i^^) < 6 + 7] ior 1 < i < m, implying that 6 + r] G A. This would 
contradict the fact that S — max A. Therefore, e = max A, and so e e A. □ 

We now give a result stating that if two measuring functions (p and ip differ 
less than e in the L^o-norm, then the cornerpoints at infinity have a distance 
smaller than e. 

Proposition 22 Let e > be a real number and let {Ai,Lp) and {Ai,ip) be 
two size pairs such that maxpg_A4 \^p{P) — 'ip{P)\ < Then, for each (oj) and 
(bi) representative sequences for ilM,<p) ^"'^ ^Im,iP)' respectively, it holds that 
d{ao,bo) < e. 



PROOF. By Prop. 13 (Localization of cornerpoints), d{ao,bo) = |min(/9 — 
min-?/^|. Let mimp = ^{Ptp) and miu'^ = ip{P^), with P^,P^ G M.. Since 
maxp£_A4 \^{P) ~ '4'{P)\ ^ then 

^^J{P^) > <p{P^) - e, <p{P^) > tl^iP^) - e. 

By contradiction, let us assume that d{aQ, bo) > e, that is, \ip{Py^)—ip{P^)\ > e. 
So either ^^(P,^) < i'iPip) — e or ijjiP^) < v{P^) — e- In the first case, 

'il^{P^)-e<ip{P^)<'il;{P^)-e, 

in the latter case, 

^{P^)-e<^{P^)<^{P^)-e. 

Hence we would conclude that either P^ is not a minimum point for or P^p 
is not a minimum point for (/?. In both cases we get a contradiction. □ 

Now we prove that it is possible to injectively match all the cornerpoints of 
^) those of tf^j^ with a maximum motion not greater than the Loo- 
distance between </? and -0. 
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Proposition 23 Let e > be a real number and let {M, ip) and {M.,tp) be 
two size pairs such that maxpg_A4 \^p{P) — 'ip{P)\ < e- Then there exist (oj) 
and ipi) representative sequences for and i^j^ respectively, and an 

injection / : N — > N such that d{ai, hf(i)) < e. 



PROOF. The claim is trivial if e = 0, so let us assume e > 0. 

Let H be the set of all the cornerpoints (both proper and at infinity) for 
(-Im,^)- write H ^ Kili K2, where Ki := {p e H : d{p, A) > e} and 

K2 :— {p & H : d{p, A) < e}. The cardinahty of Ki is finite, according to 
Prop. 14 (Local finitcncss of cornerpoints). Therefore, Prop. 21 and Prop. 22 
imply that there exist (a^) and (hi) representative sequences of points for ^-^ 
and t(^j^ respectively, such that (i(aj, 6j) < e for each Oj e Ki (it follows that 
if tti e K\ then, necessarily, bi ^ A). 

Now let us write N as the disjoint union of the sets Ji and I2, where we set 
i G /i, if flj G -ft^i, and i G I2, if G U A. We observe that, by the definition 
of a representative sequence, there is a countably infinite collection of indices 
j with bj contained in A. Thus, there is an injection P : I2 ^ N such that 
bf3(i) G A. 

We define / by setting f{i) = i for i E Ii and f{i) = j3{i) for i G /2. By 
construction, / is injective and d{ai, bf(i)) < e for every i. □ 



We recall the well-known Cantor-Bernstein theorem (cf. [23]), which will be 
useful later. 

Theorem 24 (Cantor-Bernstein Theorem) Let A and B be two sets. If two 
injections f : A ^ B and g : B ^ A exist, then there is a bijection I : A ^ B. 
Furthermore, we can assume that the equality l{a) — b implies that either 
f{a) — b or g{b) — a (or both). 

We are now ready to prove a key result of this paper. We shall use this result in 
the next section in order to prove that the matching distance between reduced 
size functions furnishes a lower bound for the natural pseudo-distance between 
size pairs. Nevertheless, this result if meaningful by itself, in that it guarantees 
the computational stability of the matching distance between reduced size 
functions. 

Theorem 25 (Matching Stability Theorem) Let {M.^ip) he a size pair. For 
every real number e > and for every measuring function : — > M, such 
that maxpg^ \v{P) ~ V'(-P)| ^ ^he matching distance between (■\m,ip) ^'^^ 
(■\M,i)) smaller than or equal to e. 
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PROOF. Prop. 23 implies that there exist {ai) and representative se- 
quences for and ^-^ respectively, and an injection / : N ^ N such 
that d{ai,bf(i)) < e. Analogously, there exist (aQ and (6^) representative se- 
quences for and £*^j^ respectively, and there is an injection g^' : N — > N, 
such that d{b[, a'g,^^^) < e for every index i. Hence another injection gr : N — > N 
exists, such that d{bi, ag(^i)) < e for every index i. Then the claim follows from 
the Cantor-Bernstein Theorem, by setting A — B — N. □ 



5 The connection between the matching distance and the natural 
pseudo-distance 



We recall that, given two size pairs (7W, ip) and (A/", ip), with A4 and Af homeo- 
morphic, a measure of their shape dissimilarity is given by the natural pseudo- 
distance. 

As a corollary of the Matching Stability Theorem (Th. 25) we obtain the 
following Th. 29, stating that the matching distance between reduced size 
functions furnishes a lower bound for the natural pseudo-distance between 
size pairs. 

Definition 26 The natural pseudo-distance between two size pairs {A4,ip) 
and (jV, ip) with Ai and H homeomorphic is the number 

infmax|<^(P)-V^(M^))|, 

where h varies in the set H{M.,J\f) of all the homeomorphisms between Ai 
and H . 

We point out that the natural pseudo-distance is not a distance because it 
can vanish on two non-equal size pairs. However, it is symmetric, satisfies the 
triangular inequality, and vanishes on two equal size pairs. 

Remark 27 We point out that an alternative definition of the dissimilarity 
measure between size pairs, based on the integral of the change of the mea- 
suring functions, rather than on the max, may present some drawbacks. 

For example, let us consider the following size pairs (A^,(/9), (A/", '?/'), (A/", x)? 
where A4 is a circle of radius 2, A/" is a circle of radius 1, and the measuring 
functions are constant functions given by(/?=l,V' = l,X = 2. Let ^l and u 
respectively denote the 1-dimensional measures induced by the usual embed- 
dings of M. and M in the Euclidean plane. By setting, for any homeomorphism 

e(/)= / \ip-^l^of\d|l, 

M 
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we have ©(/) = 0. For any homeomorphism g : H ^ Af, we have 

^{9) ^ j IV' - X o ^1 c^i^ = 27r. 

On the other hand, 

M 

Hence, the inequahty Q{gof) < Q{f)+Q{g) does not hold. This fact prevents 
the function mif^H{M,Af) ©(/) from being a pseudo-distance, since we do not 
get the triangular inequality. Furthermore this function is not symmetric for 
all couples of size pairs. 

Other dissimilarity measures based on some integral of the change of the 
measuring functions are object of a on-going research. Preliminary results 
on this subject can be found in [17]. 

For more details about natural pseudo-distances between size pairs, the reader 
is referred to [7], [8] and [9]. 

Before stating the main result of this section (Th. 29), in Th. 28 we show 
that the inf and the sup in the definition of matching distance are actu- 
ally attained, that is to say, a matching a exists for which dmatchi^i, ^2) — 
minCTmaxjd(aj, 6^(1) )■ Every such matching will henceforth be called optimal. 

Theorem 28 (Optimal Matching Theorem) Let (oj) and (bi) be two repre- 
sentative sequences of points for the reduced size functions l\ and £2 respec- 
tively. Then the matching distance between and £2 is equal to the number 
miuo- maxj d{ai, ba{i)), where i varies in N and a varies among all the bisections 
from N to N. 



PROOF. By the definition of the pseudo-distance d, we can confine ourselves 
to considering only the bijections a, such that (t(0) = 0. In this way, by 
Prop. 13 (Localization of cornerpoints) , we obtain that sup jgj^ d(ai, 6^(1)) < 
-|-oo. 

Let us first see that supjgj^ c?(ai, &(T(i)) = maxjgjs} d(aj, 6o-(i)); for any such bijec- 
tion (J. This is true because proper cornerpoints belong to a bounded set, and 
the accumulation points for the set of cornerpoints of a reduced size function 
(if any) cannot belong to A+, but only to A (see Prop. 14, Local finiteness of 
cornerpoints). By definition, for any p and p' in A, d{p,p') = 0, and hence the 
claim is proved. 
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Fig. 7. The sets a(Ji) and a(J2) of cornerpoints corresponding to the sets Ji and 
J2 used in the proof of Th. 28. The sequence (oj) is represented by dots and the 
sequence (6^) by crosses. 

Let us now prove that info- maxj (/(oj, — miiicr maxj (/(oj, Let us set 

s := inf^ maxj (i(aj, According to Prop. 14, if s = 0, then the corner- 

points of il coincide with those of i^, and their multiphcities are the same, 
implying that the claim is true. Let us consider the case when s > 0. Let 
Ji := {i eN : d{ai,A) > s} and J2 := {i e N : d{ai,A) < s} (see Fig. 7). 
By Prop. 14, Ji contains only a finite number of elements, and therefore there 
exists a real positive number e for which d{ai, A) > s + e for each i e Ji. 

Let us consider the set E of all the injective functions cr : Ji — > N such that 
maxjgj^ d{ai, &(T(i)) < s + e/2. This set is non-empty by the definition of s, and 
contains only a finite number of injections because Ji is finite, and for each 
i e Ji the set {j G N : d{ai,bj) < s + e/2} is finite. Thus we can take an 
injection : Ji — > N that realizes the minimum of max^^j^ (i(aj, 6^(1)) as a 
varies in E. Obviously, maxj^jj d{ai, ba{i)) < s, by the definition of s. 

Moreover, we can take an injection a : J2 — >■ N such that maxjgjj d{ai, b^(i)) < 
s, because, for every i e J2, we can choose a different index j, such that 
bj e A. Since N = JiU J2 and Im{a)nlm{a) — 0, we can construct an injection 
/ : N ^ N such that each displacement between and is not greater than 
s, by setting f{i) = a{i) for i G Ji, and f{i) = a{i) for i G J2- Analogously, we 
can construct an injection : N ^ N such that each displacement between bi 
and Qgii) is not greater than s. Therefore, by the Cantor-Bernstein Theorem, 
there is a bijection Z : N — > N such that d{ai, bui)) < s for every index i, and 
so the theorem is proved. □ 

Theorem 29 Let e > be a real number and let {A4,ip) and {J\f,ijj) be two 
size pairs with M. and Af homeomorphic. Then 

dmatch{t^M,<p)^^lx,^l,)) < inf max \^{P) - ip{h{P))\, 
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where h ranges among all the homeomorphisms from Ai to H . 



PROOF. We begin by observing that = ^{M,ipoh)^ where h : M ^ X is 

any homeomorphism between M. and jv . Moreover, for each homeomorphism 
h, by applying Th. 25 with e = maxpg^ \ f{P) ~ we have 

dmatch{i(M,^),ilM,^oh)) < ^a^ |(^(P) - V^(/l(P)) | . 

Since this is true for any homeomorphism h between M. and jV, it immediately 
follows that dmatch{£lM,^)'^U,^)) ^ mf maxpg^ \(p{P) - □ 



6 Construction of size pairs with given reduced size functions and 
natural pseudo-distance 

The following lemma states that it is always possible to construct two size pairs 
such that their reduced size functions are assigned in advance, and their nat- 
ural pseudo-distance equals the matching distance between the corresponding 
reduced size functions. 

This result evidently allows us to deduce that the lower bound for the natural 
pseudo-distance given by the matching distance is sharp. Furthermore, in the 
next section, we will exploit this lemma to conclude that, although other 
distances between reduced size functions could in principle be thought of in 
order to obtain a lower bound for the natural pseudo-distance, in practice it 
would be useless since the matching distance furnishes the best estimate. 

The proof of this lemma is rather technical, so we anticipate the underlying 
idea. We aim at constructing a rectangle R and two measuring functions (f 
and T/i on R, such that = q^^^^ and = for given (M,ip) 

and (Afjip). To this aim, we fix an optimal matching between cornerpoints of 
^M,ip) i^j^^^) that achieves dmatch{t^^M,<pY^lN,i>))- We begin by defining (p 
and ip as linear functions on i?, with minima respectively at height min if and 
min-?/'. Thus, the corresponding reduced size functions have cornerpoints at 
infinity coinciding with those of and t^^j^ respectively, and no proper 

cornerpoints. Next, for each proper cornerpoint {x,y) for ^(^^), considered 
with its multiplicity, we modify the graph of (p by digging a pit with bottom 
at height x and top at height y (see Fig. 10). This creates a proper corner- 
point for d-^n^) at {x,y). Of course, we take care that this occurs at different 

points of R for different cornerpoints of ^R^y Now we modify the graph of ip 
exactly at the same point of R as for (p, as follows. If the cornerpoint {x, y) for 
9?) matched to a cornerpoint (x', y') of ^j^^y we dig a pit in the graph 
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of ijj with bottom at height x' and top at height y'. This creates a proper 
cornerpoint for ^(^^-j at {x',y'). Otherwise, if the cornerpoint {x,y) for ^*Mip) 
is matched to a point of the diagonal, we construct a plateau in the graph 
of at height This does not introduce new cornerpoints for ^*(^j^^y We 
repeat the procedure backwards, for cornerpoints of ^^^^ are matched 

to points of the diagonal in i*M,^)- Although this is the idea underlying the 
construction of the measuring functions, in our proof of Lemma 30 we shall 
confine ourselves to giving only their final definition. 

Now, in order to show that dmatchi^ln is equal to the natural pseudo- 

distance between (i?, (p) and (i?, ^), we consider the identity function on R. We 
measure the difference between (p and at each point of R. By construction, 
this difference is greater at pits and plateaux than elsewhere, that is to say, 
corresponds to the matching of cornerpoints. Hence, the identity homeomor- 
phism of R achieves a change in the measuring functions (p and ip equal to the 
matching distance between ^\r^^) and ^^^^y This imphes that the matching 

distance is equal to the natural pseudo-distance between {R,ip) and {R,ip), 
and the identity function on R is exactly the homeomorphism attaining the 
natural pseudo-distance. 

The last step of the proof is to enlarge the rectangle i? to a topological 2-sphere 
and to extend the measuring functions to this surface, without modifying the 
corresponding reduced size functions and the natural pseudo-distance. 

Lemma 30 Let i'^j^ and be two reduced size functions. There always 

exist two size pairs {A4',ip') and {Ad'jip'), with M.' homeomorphic to a 2- 
sphere, such that the following statements hold: 



(1) 
(2) 



(3) dmatch{^lM',v^')^^M','^')) = inf ft maxpgyn/ |(/7'(P) - il;'{h{P))\, h sweeping 
the set H{M.',A4') of all self-homeomorphisms of A4', that is to say the 
matching distance equals the natural pseudo-distance; 

(4) inf/i maxpg_yvi' Iv^'(-P) — "^'(^(-P)! = maxpg_A4' \ ~ "^'(-P)!; that is to 
say, the identity homeomorphism attains the natural pseudo- distance. 



PROOF. Possibly by swapping the size pairs, we can assume that min (p < 
min'0. 

Let (oj) and be two representative sequences for and ^j^^y respec- 

tively. It is not restrictive to assume that the identical bijection i i— > i is an 
optimal matching such that dmatch{^*M,if)T ^(Af,-)!})) attained. We define 

7 := {i e N - {0} : a, ^ A}, J:={jeN- {0} : b, ^ A}. 
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Fig. 8. Case i £ I. Left: The graph of the function ip{^,y). Right: The graph of the 
functions (^(3^,2/). 

For every z > 0, let = (xj, i/i) and bi = (x^, y^). Then 

lim {yi - Xi) = 0, lim {y- - x'^) = 0, 

i—>oo t^oo 

because of Prop. 14 (Local finiteness of cornerpoints). 

We choose a real number S with S > max{maxy9, max?/'}. Necessarily S > 
mimp. Moreover, we define (ej)jgN-{o} to be a sequence of positive real number 
tending to and such that, for every i G N — {0}, it holds that: 



Xi + yi 



Xi + yi 



2 2 
Finally, let R := [0, 1] x [miny?, S] C R"^ 



+ ei 



C (miny9, S). 



We define (f : R -^Rhy the following conditions: 

(i) Let i E I and = {xi,yi). Then (p[^^,y) is the piecewise linear function in 
the variable y G [min(y9, S] whose graph, consisting of four segments, is repre- 
sented in Fig. 8(left). Furthermore, (^(3^,1/) and (p{^:^,y) are the piecewise 
linear functions in the variable y G [min(y9, S*], whose graph (the same) consists 
of three segments, as represented in Fig. 8(right). 



(ii) Let i G J— I and hi = (a;-, y'^). Then (p{-^^,y) and (p{-^:^, y) are the piecewise 
linear functions in the variable y G [mine/?, S] whose graph, consisting of three 
segments, is represented in Fig. 9. Let us observe that > min y?, since we 
are assuming minyj < min^. 



(iii) ^9(0, y) and </)(!, y), as functions in the variable y G [min ip, S], are defined 
as the identity function y y. 



(iv) For any x G [0, 1] where not already defined, (p{x, y) is defined by linear 
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+ 



S 



Fig. 9. Case i £ J — I. The graph of the functions Lp{^,y) and <p{-^j^,y). 




Fig. 10. A "pit" of the function if : R 
extension with respect to the variable x. 



The aspect of the graph of ip in correspondence of a cornerpoint is represented 
in Fig. 10. 

Let us observe that, by construction, ip{x,y) is continuous everywhere in R, 
except possibly at x = 0. Actually, ip{x, y) is continuous also at x = 0. Indeed, 
when i tends to infinity, ^ and — Xj) 0. Thus the functions (p{\, y) 
with i > tend to the identity function in the sup norm. 

It is not difficult to check that = t^j^ . 

Let us now analogously define the function ip : R M., where R is still defined 
as [0, 1] X [min(y9, S] C M^, by means of the following conditions: 

(i') Let i G J and 6, = (x^,y^). Then ■?/'(^,?/) is the piecewise linear function 
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Fig. 11. Case i G J. Left: The graph of the function 'il>{^^y). Right: The graph of 
the functions ottt, y). 




s 



min^/; 



9- 



+ 

s 

+ <M 



s 



Fig. 12. Case i & I — J. Left: The graph of the functions ip{-^,y) and -0(3^,?/), 
when ^*~^y* > min^/^. Right: The graph of the functions ip{-^,y) and V'(3^k,y), 
when < min^. 

in the variable y G [min ip, S] whose graph, constituted of four segments, is 
represented in Fig. ll(left). Furthermore, ij{^^-,y) and '4'{^:^iy) ^-^e the 
piecewise linear functions in the variable y G [min ^p, S] whose graph (the 
same), constituted of three segments, is represented in Fig. 11 (right). 



(ii') Let i E / — J and Oj = {xi,yi). Then, if > minip, we take '?/'(^, y) and 
^^siST'^) piecewise linear functions in the variable y G [min(y9, S'], 

whose graph, constituted of three segments, is represented in Fig. 12(left). 
Otherwise, if < min-^, we take '?/'(^, ?/) and 'ipi-^;^, y) to be the piecewise 
linear functions in the variable y G [min y9, S] whose graph, constituted of two 
segments, is represented in Fig. 12 (right), 

(iii') TpiQ^y) and ip{l,y), as functions in the variable y G [miny?, 5*], are both 
defined as the function y ^ glm'ny {v ~ mincy?) + min-?/', that is to say, the 
linear function that takes value min-?/', when y = min 99, and value S, when 

y = s. 
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(iv') For any x G [0, 1], where not already defined, ip{x,y) is defined by linear 
extension with respect to the variable x. 



Again, one sees that -0 : i? — > R is continuous and 



Let us now show that d^natchit^R^^y t^^^^^) = maxpgi^ |v9(P) - 'ipiP)\- It may 
be useful to recall that dmatch{PlR,^)A\R^^)) = maxjeN where, if a, = 



Xi,yi) and k = (x^. 


y'i), 




if i = 


d{ai, hi) 


— min ijj — min (p; 


if i e 7 n J, 


d{ai, hi) 


^max{\y'i-yi\,\x'i 


if i e / - J, 


d{ai,bi) 


Vi . 

2 ' 


if i e J - /, 


d{ai,bi) 


y'-x'. 
2 ' 


if i ^ / U JU{0}, 


d{ai, hi) 


= 0. 



As for maxpgp \ f{P) ~ '4'{P)\j notice that the maximum must be attained at 
a point {x, y) G R, where either x = ^ or a; = g^^, with i > 0, or x = 0, or 

X — 1. Indeed, (p and ip are linearly defined in the variable x elsewhere. Let 
us consider the various instances separately. 

For X = and a; = 1, it holds that 

max \(f{x,y) — ip{x,y)\ — min'^ — mine/? = d{ao, ho). 

ye[mm<p,S] 



For x = ^ or X = g^^, different cases are possible. 

Case i ^ I D J . By looking at the graphs of ip{^,y) when i E I and ip{^,y) 
when i e J, we immediately see that 



max 

y£[mmip,S] 



max{ • ~ yi\,\x[ — Xi\, min ijj — min (p} 
max{(i(aj, hi), d{ao, bo)}. 



By looking at the graphs of (f{-^^,y) when i & I and ip{-^:^,y) when i & J, 
we see that 



max 

ye[min(p,S] 



max{ li/j' — yi\, min ijj — min (/?} < 
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< max{d(ai, bi), d{ao, bo)}. 



Case i e I - J. If > mmip, then 



max 

ye[rain<p,S] 

— max < 



Xi 



Xi + Vi 



, min ijj — min (/?| 



1 1 1 ; 1 X • 



Z/i Xi 



, min — min (/jj = 
= max{d(aj, b^), d{ao, bo)}. 

Or else, if < min^, then 

lUi — min -01 < \xi — min'^j, 

because xi < yi < 2min'0 — and hence x^ — min'^ < min'^ < min'^ — 
Moreover, |xj — min'^j = min'^ — Xi < min'^ — mine/?. Therefore, 



max 

2/e[min<^,5] 



max{ li/j — min '?/;| , \xi — min ■^j , min ^0 — min </?} 
min -0 — min (p = d{ao, bo). 



Analogously, if ^^^^ > min0, then 



max 



3i±l 



3i± 1 



max{ci(aj, k), d{ao, bo)}, 



or, if < min^, then 



max 

ye[minip,S] 



3i± 1 



3i± 1' 



d{ao,bo). 



Case i e J — I. We have that 



max 

ye [min 1^,5] 



= max • 



3i 

/ x'i + y'i 



max • 



yi 



yi-^i 



/ a;^ + y'i 



X, 



, min ijj — min </? / = 



, min jjj — min </? / = 



= max{(i(aj, 6,), (i(ao, 6o)}, 
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and 



max 

ye[min(p,S] 



max{d{ai,bi),d{ao,bo)}. 



From all these facts we deduce that 

m|X \ip{P) - '0(P)| = dmatch{ilR,^),ilR^^^). 

To complete the proof, it is now sufficient to extend the above arguments to 
a topological 2-sphere . To do this, let us take 

M' := d{R X [0,1]) = a([0,l] X [mm(p,S] x [0,1]). 

Obviously, A4' is homeomorphic to a 2-sphere. 

Moreover, by identifying R x {0} with R, let us define : Ai' — > M as 

follows: <^|Rx{0} = 'f^ <^|[0,l]x{mmv.}x[0,l] = ^in <^|[o,l]x{5}x[0,l] = ' 

us define </?' by linear extension in y e [min (p, S] elsewhere. Finally, let us 
define ij;' : M' R analogously: V^f^xio} = V^, V'ip.ilximinv.lxio,!] = ^inV, 
'/'|[o,i]x{5}x[o,i] = define ip' by linear extension in y e [min (^,5"] 

elsewhere. This completes the proof. □ 



Remcirk 31 It is worth noting that in our construction, starting from a 2- 
manifold, the requirement cp' and ■0' of class C° cannot be improved to C^. 
Indeed, it is possible to construct examples of reduced size functions ^*M,<p) 
such that each point of the set 

X := {ix,x) e A : < X < 1} 

is the limit point for a sequence of cornerpoints of If it were possible 

to construct ip' of class such that = the coordinates of each 

point in X would be the limit point for a sequence of critical values of (p' 
(cf. Cor. 2.3 in [12]). Since the set of critical values is closed, it would follow 
that any point in [0, 1] is a critical value for p'. This would contradict the 
Morse-Sard Theorem stating that, for a C^' map / from an m-manifold to 
an n-manifold, if r > max{0, m — n} then the set of critical values of / has 
measure zero in the co-domain (cf. [22], p. 69). We do not know whether it 
is possible to prove Lemma 30 with p' and ip' of class on a 2-manifold. 
Similarly, the Morse-Sard Theorem implies that it is not possible to construct 
size pairs satisfying the properties of Lemma 30, with p' and ip' of class 
on a 1-manifold, but we cannot exclude that this could be done by means of 
C° functions. 
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7 Compcirison with eeirlier results 



This section aims to show that dmatch is the most suitable metric to compare 
reduced size functions, mainly for two reasons. In the first place, wc prove 
that one cannot find a distance between reduced size functions giving a better 
lower bound for the natural pseudo-distance than the matching distance. In 
the second place, we show that the lower bound provided by the matching 
distance improves an earlier estimate also based on size functions. The main 
tool to obtain these results is Lemma 30. 

Theorem 32 Let 5 he a distance between reduced size functions, such that 
Kt,M„V^U,,)) < ^Jnf max|^(P) -^(MP))|, 

for any two size pairs {M.,(p) and (A/", V') with M. and J\f homeomorphic. 
Then, 

PROOF. We argue by contradiction. Let us assume that there exist two size 
pairs (f) and (A/", ip), with M. and M homeomorphic, such that 

dm.atch{^\M,<p)i^\N,il})) < '^(^(7W,¥')'-^(Ar,V))- 

By Lemma 30, there exist {M',<p') and (A1',V'') such that ('\M,ip) ~ ^*M',tp')^ 
Of course, ^(at.v-)) = H^*M',<p')^^*M',^p'))- Hence, 



giving a contradiction. □ 



Analogously, we show that the inequality of Th. 29 

is a better bound than the one given in [7], that we restate here for reduced 
size functions: 
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Theorem 33 If there exist {x,y) and (Cv) such that ^*M,<p)i^Ty) > 

V,V')(^'^)' ^^^^ 

heHfMM)p'€M 1*^^^^ ~ V'(M^))I > min{^ -x,y- 77}. 



Indeed, we have the foUowing result. 
Theorem 34 Assume that 

A := {{{x,y), eA+xA+:^>x,7j< y,t^M,^^{x,y) > V,^)!^,^;)} 

is non-empty, and let 

s :— sup {min{^ — x,y — rj}} 

(in other words, s is the best non-negative lower bound we can get for the 
natural pseudo-distance inf/jinaxpex \(fi{P) — ip{h{P))\ by applying Th. 33). 
Then 

dmatch{(-\M,ip)i^*{M,il})) — ^■ 

PROOF. We argue, by contradiction, assuming dmatch{£lM,<p)^^*M,i^)) < 
Since s is a sup, there is a pair ^(x, y), (^, fj)^ E A satisfying 

dmatch{£lM,v)^^U,^)) < min{^ -x,y-f]} =: s' 

with s' < s. 

By Lemma 30 we can construct two size pairs {Ai', (fi') and {Ai', ip') such that 

P* — f* P* — P* nnrl 

dmatch{qM',^'y^lM',^')^ = ^^^Im' ,M') ^M' ''^'^^^ " 

Clearly, the set A coincides with the set 

S := { ( (x, y) , (e, 77) ) e A+ X A+ : e > X, 77 < y , (x, y) > q^,^^,^ (C, 77) } . 



Then \{x,y),{(^,rj)j G B, implying s' < inihUiaxp^ji^i \(p'(P) — 'ijj'(h{P))\, 
because of Th. 33. Finally, observing that 

s' < inf max \cp\P) - ^'(/i(P))| = 

h£H{M',M') PeM' 
— dmatch{qM',tp')T^{M',tl}')) ~ dmatchi^M,'^)-! ,'^)) ^ ^' i 

we obtain a contradiction. □ 
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8 Conclusions 



The main contribution of this paper is the proof that an appropriate distance 
between reduced size functions, based on optimal matching, provides the best, 
stable and easily computable lower bound for the natural pseudo-distance 
between size pairs. Hence, the problem of estimating the dissimilarity between 
size pairs by the natural pseudo-distance can be dealt with by matching the 
points of the representative sequences of reduced size functions. The estimate 
in Th. 29 improves an earlier one given in [7], also based on size functions but 
without the use of cornerpoints. 

The stability of the matching distance between reduced size functions with 
respect to continuous functions is important by itself. Indeed, it allows us to 
use reduced size functions as shape descriptors with the confidence that they 
are robust against perturbations on the data, often arising in real applications 
due to noise or errors. 

A crucial result in our paper is the proof that it is always possible to construct 
two suitable measuring functions on a topological 2-sphere with given reduced 
size functions and a pseudo-distance equaling their matching distance (Lemma 
30). This result has allowed us to prove that the matching distance is the best 
tool to compare reduced size functions. Indeed, after Th. 32, we know that 
it makes no sense to look for different metrics on size functions in order to 
improve the lower bound for the natural pseudo-distance furnished by the 
matching distance. However, it would be interesting to study whether the 
inequality of Theorem 10 in [18], giving a lower bound for the natural pseudo- 
distance via the size homotopy groups, can be improved using appropriate 
analogs of the concepts of cornerpoint and matching distance, in the same 
way that the estimate in Th. 29 improves the result given in [7]. 

References 

[1] A. Brucale, M. d'Amico, M. Ferri, L. Gualandri, and A. Lovato, Size 
functions for image retrieval: A demonstrator on randomly generated curves, in 
Proc. CIVR02, London, M. Lew, N. Sebe, and J. Eakins, eds., vol. 2383 of 
LNCS, Springer- Verlag, 2002, pp. 235-244. 

[2] F. Cagliari, B. D. Fabio, and M. Ferri, One- dimensional reduction of 
multidimensional persistent homology. arXiv:math/0702713vl, 2007. 

[3] A. Cerri, M. Ferri, and D. Giorgi, Retrieval of trademark images by means 
of size functions, Graph. Models, 68 (2006), pp. 451-471. 

[4] D. Cohen-Steiner, H. Edelsbrunner, and J. Harer, Stability of 
persistence diagrams, Discrete Comput. Geom., 37 (2007), pp. 103-120. 



33 



[5] M. d'Amico, a new optimal algorithm for com,puting size function, of shapes, 
in CVPRIP Algorithms III, Proceedings International Conference on Computer 
Vision, Pattern Recognition and Image Processing, 2000, pp. 107-110. 

[6] F. DiBOS, P. Frosini, and D. Pasquignon, The use of size functions for 
comparison of shapes through differential invariants, Journal of Mathematical 
Imaging and Vision, 21 (2004), pp. 107-118. 

[7] P. DONATINI AND P. Frosini, Lower hounds for natural pseudodistances via 
size functions, Archives of Inequalities and Applications, 2 (2004), pp. 1-12. 

[8] , Natural pseudodistances between closed manifolds, Forum Math., 16 

(2004), pp. 695-715. 

[9] , Natural pseudodistances between closed surfaces. Journal of the European 

Mathematical Society, 9 (2007), pp. 231-253. 

[10] P. DoNATiNi, P. Frosini, and C. Land:, Deformation energy for size 
functions, in Proceedings Second International Workshop EMMCVPR'99, E. R. 
Hancock and M. Pelillo, eds., vol. 1654 of Lecture Notes Comput. Sci., 1999, 

pp. 44-53. 

[11] A. Efrat, a. Itai, and M. Katz, Geometry helps in bottleneck matching and 
related problems, Algorithmica, 31 (2001), pp. 1-28. 

[12] P. Frosini, Connections between size functions and critical points. Math. Meth. 
Appl. Sci., 19 (1996), pp. 555-596. 

[13] P. Frosini and C. Landi, New pseudodistances for the size function space, in 
Vision Geometry VI, R. A. Melter, A. Y. Wu, and L. J. Latecki, eds., vol. 3168 
of Proc. SPIE, 1997, pp. 52-60. 

[14] , Size functions and morphological transformations. Acta Appl. Math., 49 

(1997), pp. 85-104. 

[15] , Size theory as a topological tool for computer vision. Pattern Recognition 

and Image Analysis, 9 (1999), pp. 596-603. 

[16] , Size functions and formal series, Appl. Algebra Engrg. Comm. Comput., 

12 (2001), pp. 327-349. 

[17] , Reparametrization invariant norms. arXiv:math/0702094vl, 2007. 

[18] P. Frosini and M. Mulazzani, Size homotopy groups for computation of 
natural size distances. Bull. Belg. Math. Soc, 6 (1999), pp. 455-464. 

[19] P. Frosini and M. Pittore, New methods for reducing size graphs. Intern. 
J. Computer Math., 70 (1999), pp. 505-517. 

[20] R. Garfinkel and M. Rao, The bottleneck transportation problem. Naval 
Res. Logist. Quart., 18 (1971), pp. 465-472. 

[21] M. Handouyaya, D. Ziou, and S. Wang, Sign language recognition using 
moment-based size functions, in Vision Interface 99, Trois-Rivieres, 1999. 



34 



[22] M. W. HiRSCH, Differential topology, Springer- Verlag, 1976. 

[23] K. KURATOWSKI AND A. MoSTOWSKi, Set Theory, North- Holland, 1968. 

[24] L. J. Latecki, R. Melter, and A. G. eds., Special issue: Shape 
representation and similarity for image databases. Pattern Recognition, 35 
(2002), pp. 1-297. 

[25] R. Veltkamp and M. Hagedoorn, State-of-the-art in shape matching, in 
Principles of Visual Information Retrieval, M. Lew, ed., Springer- Verlag, 2001, 
pp. 87-119. 

[26] S. Willard, General topology, Addison- Wesley Publishing Company, 1970. 



35 



